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^unmary«  ^  Moat  of  the  well  known  significance  tests  and  confidence  intervals  for  the 

lif 

population  mean  are  based  on  the  assumption  of  a  random  sample,  Thie  paper  considers  how 
the  significance  levels  and  confidence  coefficients  of  a  commonly  used  class  of  these  tests 
and  intervals  are  changed  when  the  random  sample  requirement  is  violated  and  the  number  of 
observations  is  large.  It  is  found  that  the  introduction  of  even  a  slight  amount  of  correla¬ 
tion  can  result  in  a  substantial  significance  level  and  confidence  coefficient  change.  Thus 
this  class  of  tests  and  confidence  intervals  would  seem  to  be  of  questionable  practical 
value  for  large  sets  of  observations.  For  two  types  of  situations  of  practical  Interest, 
methods  are  outlined  for  obtaining  large  sample  tests  and  confidence  intervals  for  the  mean 
which  are  not  sensitive  to  the  presence  of  correlation.  These  results  are  as  efficient 
(asymptotically)  as  the  tests  and  intervals  they  replace  and  are  applicable  to  the  general 
situation  where  the  observations  are  not  from  the  same  population,  (  ) 

Introduction  and  Statement  of  Results,  In  deriving  statistical  tests  and  confidence 
intervals,  certain  assumptions  are  made,  V/hon  these  tests  ar<d  intervals  are  applied  to 
practical  situations,  their  validity  depends  upon  how  closely  the  assumptions  are  approxi¬ 
mated,  One  assumption  frecjuently  made  Is  that  a  set  of  observations  is  a  random  samplej 
i,e,,  that  the  observations  are 

(a) .  Statistically  independent, 

(b) .  From  the  same  ;x)pulalion  (i.e,,  the  observations  have  the  same  univariate 

distribution  function). 

One  of  the  principal  purposes  of  thia  paper  is  to  study  how  sensitive  a  conmonly  applied 
class  of  large  sample  tests  arxl  confidence  intervals  for  the  population  mean  is  to  violation 
of  assumption  (a). 

Let  the  n  observations  used  bo  from  populations  with  caramon  mean  p  while  the  values  of 
the  observations  are  denoted  by  x^,  •••,  x^.  The  class  of  tests  and  confidence  intervale 
for  p  Investigated  here  consists  of  ttiose  based  on  the  quantity 


(1) 


I 


where  x  ■  ^  x^^/n.  If  the  n  observations  are  a  random  sjnple  from  a  j)Opulation  for  which 
the  first  two  moments  exist  (almost  all  pofulations  approximated  in  practice  have  this 
property),  the  distribution  of  (1)  is  approximately  normal  with  zero  mean  and  unit  variance 
for  n  sufficiently  large.  The  cjuantity  (1)  ie  the  well  known  Student  t-statistic;  If  the 
observations  were  a  random  sample  frcn  a  normal  population,  this  quantity  would  have  a 
Student  t-dlstribution  with  n  -  1  degrees  of  freedom  (n  >  2), 

For  large  v^ilues  of  n,  it  is  found  that  the  introduction  of  an  average  correlation  as 
small  as  ,001,  or  sven  ,0001,  can  result  in  substanti;Q  changes  in  the  values  of  significance 
levels  and  confidence  coefficients  for  the  class  of  tests  and  intervals  based  on  (1),  As 
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av«ra£«  correlntlono  of  this  magnitude  uBU£Llly  defy  detection,  it  is  possible  that  such 
correlations  exist  in  many  practical  situations  where  a  handom  sample  is  assumed.  Thus  it 
would  seen  important  to  replace  this  class  by  other  tests  and  confidence  intervals  which 
are  not  sensitive  in  this  respect.  Replacement  results  are  derived  for  two  special  types 
of  situations i 

(i).  The  observations  can  be  divided  into  two  or  more  nearly  equal  subsets  such  that 
the  correlation  within  subeets  is  very  much  greater  (in  magnitude)  than  the 
correlation  among  subsets, 

(li).  The  order  in  which  the  observations  wore  drawn  is  known.  The  correlation  between 
nearby  observations  in  this  ordering  is  very  much  greater  (in  magnitude)  than  the 
correlation  between  distant  observations. 

As  an  example  of  (i),  consider  an  agricultural  experiment  where  part  of  the  observa¬ 
tions  cone  from  one  locality,  part  from  another  locality,  etc.  For  this  case  it  is  often 
permissible  to  asaiaM  approximate  independence  between  observations  from  different  localities 
and  noticeable  correlation  between  observations  from  the  sane  locality.  If  the  localities 
do  not  yield  approximately  the  same  number  of  observations,  subeets  satisfying  (i)  can 
frequently  be  obt fined  by  combining  localities. 

Now  let  us  consider  situation  (ii).  If  the  observations  arc  obtained  from  a  process 
which  produces  one  observation  at  a  time,  in  maqy  cases  it  Is  reasonable  to  assume  that  the 
correlation  between  two  observations  depends  on  their  location  in  the  time  ordered  sequence 
of  obaervationa.  Two  obseivations  distant  from  each  other  are  almost  independent  while 
observations  near  each  other  are  appreciably  correlated. 

Consider  case  (i)  and  let  the  n  observations  be  divided  into  r  approximately  independent 

subsets.  The  subsets  contain  about  the  earns  number  of  observations  and  the  s,  observatii-ns 

th  ^ 

of  the  k  subeet  are  given  the  special  notation 


x^(k),  x^(k),  •••,  x^  (k). 


(k  -  1,  r). 


Actually,  this  set  of  observations  la  a  subset  of  the  observations  x^,  x^  tmt  was  given 

an  additional  notational  representation  to  simplify  fjresentation  of  results.  Thus,  for  this 

case,  each  obeervation  has  two  representations,  one  as  an  x,  and  the  other  as  x.(k).  This 

J 

double  representation  should  result  in  no  confur.lon  since  the  only  place';  in  whlchi  the 

notation  ie  used  are  those  where  all  n  observations  ire  considered  r inult;ineou:;  ly  and  given 

/  no 

identical  treatment  (e.g,,  ^  x^^,  etc.).  The  tests  and  confidence  intervals  developed  for  p 


are  based  on  the  quantity 
Vn  (x  -  p) 


(2) 


where 


^  n  2  j.  k  ^ 

”  1  ’  B  ^  x,x, 

1  k-1  ur^v-l  ^  i  J 


A  .  n/[n{n  -  1)  '  ^  »k(»x  -  DJ  . 


B  -  1/n  -  A. 


ir  n  1.  Ure.  and  condition  (1)  holds,  th,  distribution  of  (.)  1,  approxlmtely  normal  with 
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zero  mean  and  variance  approximately  unity  for  most  aituatlona  of  practical  interest. 

Under  some  restrictions  which  are  not  of  practical  importance,  it  can  be  shown  that 
the  asymptotic  distribution  of  (2)  is  normal  with  zero  mean  and  unit  variance  when  the  n 
observations  are  a  random  sample.  Thus  the  results  based  on  (2)  have  the  same  efficiency 
(asymptotically)  as  the  corresponding  tests  and  confidence  intervals  based  on  (1)  for  the 
case  of  a  random  sample.  However,  the  results  based  on  (2)  ar^  also  valid  for  case  (1), 
Now  let  us  consider  case  (li).  Then  the  order  in  which  the  observations  were  drawn 
is  known.  Let  denote  the  value  of  the  first  observation  drawn,  the  value  of  the 


•econd  observation,  •••,  the  value  of  the  last  observation  drawn.  It  ie  supposed  that 


an  integer  value  m  is  known  (m  <  n/2  -  1)  euch  that  the  correlation  between  any  two 
observations  x^,  x^  con  be  neglected  if  |i  -  Jj  >  m.  Then  the  tests  and  confidence 


intervals  derived  for  p  are  based  on  the  quantity 


(3)  ^  {I  -  p)/^|i  2  ^  C  I  x^(x^^^  .  •••  ♦  ♦  D  x^Xj 


where 


C  -  -  2m  -  1), 


D  -  -(2m  ♦  l)/n(n  -  2m  -  1), 


In  this  expression,  appear  for  which  1  >  n;  by  definition, 

cases.  If  n  is  sufficiently  large  and  (li)  la  satisfied,  the  distribution  of  (3)  is  nearly 
normal  with  zero  mean  and  varlmce  approximately  equal  to  unity  for  most  situations  of 
practical  importance. 

For  the  :a3e  of  a  random  sample,  under  mild  restrictions  it  can  be  shown  that  the 
asymptotic  distribution  of  (3)  is  normal  with  zero  uean  and  unit  variance.  Hence  the 
results  based  on  (3)  furnish  the  same  amount  of  " ir.fomation”  (asymptotically)  as  the 
corresponding  results  based  on  (1)  for  the  case  of  a  random  sample. 

In  general,  the  v/tlue  chosen  for  m  should  not  be  too  snail;  oth-rvlse  the  neglected 
effect  of  a  Large  number  of  small  correl  .tlons  might  become  important.  On  the  other  hand, 
too  large  a  value  of  m  results  in  conputational  and  other  difficulties  for  the  solution 
presented,  A  rule  of  thumb  which  is  likely  suitable  for  most  sltUMtions  is  to  set  m  *  n/lOO 
for  situations  where  there  is  reason  to  believe  that  smaller  values  of  m  would  be  satisfactory'. 
What  is  meant  by  the  statement  "a  sufficiently  large  value  of  n"  as  us  id  in  this  section 
is  difficult  to  specify  in  general.  However,  n  >  1,000  would  seem  to  be  satisfactory  for 
many'  situations.  The  results  presented  apply  to  both  discrete  and  co’  tlnuous  variables, 
Investi/'ation  of  bpecified  Class.  Now  let  us  investigate  how  sen'-itive  the  tests  and 


confidence  intervals  based  on  (1)  and  the  assumption  of  a  random  sample  are  to  slight 

^  _ _  j _  __  2  •  .t  _  »  .j  ,  . 


violation  of  condition  (a).  Denote  the  variance  of  by  and  the  correlation  between 
x^  and  Xj  by  (i  /  J  -  1,  •••,  n).  Also  let  each  observation  have  the  same  expected 
value  p. 

Imposing  some  weak  restrictions  on  the  behavior  (ns  n  increases)  of  the  fourth  and  lower 
order  inoraents  (mixed  or  otherwise)  of  the  multivariate  poj-ulntion  from  which  x^,  •••,  x_ 


•.  x_ 
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i«  a  saapl*  valu«,  the  variance  of 


tonda  to  zero  as  n  approachea  infinity.  Thus,  since  the  expected  value  of  this  expression 
equal a 


"  ?  "  1^1 

it  follows  fron  Tchel^cheff 'a  Inequality  combined  with  the  convergence  theorem 
aaynp.  otic  ally  the  distribution  of 


(4) 


o^OjAn  -  1) 

^ij®i°j  J 


1 

2 


{x.^  -  x)V(n  -  1) 


IJ  that 


haa  zero  mean  and  unit  variance.  Hence,  for  large  n,  the  variance  of  (1)  differs  from  its 
hypothetical  value  of  unity  by  the  factor 


[l  .  (n  -  l)f]/{l  -  f), 

where 

?  -  ^  21  0^  ■ 

The  range  of  penaisaible  values  for  ^  is  -l/(n  -  l)tol.  If  the  have  the  same  value,  f 
representa  the  average  correlation  among  the  observations;  i.e,, 

^  ^ 

For  moat  situations  of  practical  interest,  the  distribution  of 


is  asymptotically  normal.  Thus,  using  Tchebycheff 'a  Inequality  and  the  convergence  theorem 
[ij,  for  these  situations  the  distribution  of  (4)  is  apj'roxinvitely  normal  with  zero  mean 
and  unit  variance  for  n  sufficiently  large.  Let  us  consider  a  class  of  one-sided  confidence 
inteinrala  for  p  obtained  by  use  of  (1)  under  the  assumption  of  a  random  sranple;  here  it  Is 
assumed  that  the  distribution  of  (A)  ie  opj  roximately  normal ,  Let  denote  the  standardized 
normal  deviate  (zero  mean,  unit  variance)  exceeded  with  probability  The  one-eided 
confidence  interval 

(5)  (x  ♦  (x^  -  x)V(n  -  1)  ,  oo) 

is  then  assumed  to  have  confidence  coefficient  €  ;  i.e,, 


Pr(x  ♦  V2I  “  x)^An  -  1)  -  m) 


6, 
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Actually, 


[x  ♦  {x^  -  x)V(n  -  1)  <  ^ij 

-  Prjj  ♦  VB-  -  i)f]X  (x^  -  x)Vn(n  -  1)(1  -  ^  <  ^l  • 


wh«re  a  is  defined  by  the  relation 

V(1  -  f)/[l  •  (n  -  l)f  ]  . 

If  the  obeervatlone  are  a  random  sample,  f  •  0  and  a  ■  6.  If  f  /  0,  howorer,  a  can 
differ  noticeably  from  6,  For  example,  let  €  -  ,05,  n  •  10,000  and  f  >  ,001.  Then 
a  >  .31.  If  ^  <  -.00008  for  this  case,  a  ^  .00012.  Thus  very  slight  deriallons  of  ^ 
from  zero  can  result  In  substantial  deviations  of  the  true  confidence  coefficient  of  (5) 
from  its  ^lypothetlcal  value.  Analo^^ous  considerations  apply  to  other  one-sided  and  two- 
sided  confidence  intervals  and  to  significance  tests  taaed  on  thene  confidence  Intervals, 

From  the  definition  of  f  and  the  analysis  of  this  section,  it  is  seen  that  only 
slight  average  correlations  need  be  introduced  to  cause  the  si;;nif icance  levels  and  con¬ 
fidence  coefficients  of  tests  and  intervals  Lasei  on  (i)  to  differ  substantially  from  t.-«ir 
assumed  values.  Thus  results  based  on  (1)  axid  the  ai.sumptlon  of  a  random  sample  are  very 
sensitive  to  slight  deviations  from  (a). 

Derivations  for  Case  (i).  Here  it  will  be  shown  that  the  asympio’ic  distribution  of 
(2)  is  normal  with  zero  awan  and  variance  approximately  equ  J  to  unity  if  (l)  and  certain 
minor  restrictions  are  ■atlsfled. 

For  the  purpose  of  the  analysis,  it  will  be  arsumed  that  s,  ■  s^  •  •••  ■  s  ,  TTae 
results  obtained  on  the  basis  of  this  assumption  should  not  be  appreciably  changed  if  these 
equalities  are  only  approximately  satisfied.  Using  the  restriction  on  the  Sj^,  ttie  expression 


,  n  T  r  n_ 

-  £  ♦  A  ^  >  x^(k)x^(k)  ♦  B  X 

1  k"l  u^-1  i7J*l 


x^x^ 


can  be  written  as  a  linear  combination  of  the  quantities 


S  (x.  -  x)^,  ^[x  (k)  -  x(k)]^,  (k  -  1,  •••,  r), 

1  u-1  ^ 


wtiere 


x(k)  -  ^  x^(k)/8j^. 

Thus  replacing  each  x  by  x  -  p  leaves  the  value  of  (6)  unchanged. 

Consider  the  expected  value  of  (6),  Subtracting  p  from  each  x  in  (6)  and  then  taking 
expected  values,  this  is  found  to  equal 

i  i  0^  .  A£  ^  [x^(K)  -  [x^(k)  ->']}•  B  ^ 

On  the  basis  of  condition  (1), 


p-n2 


-  6  - 

for  reasonable  t/peeof  bituationt.  Thus  the  expected*  value  of  {6)  is  approxinmtely  equal  to 

With  Bome  mild  restrictlona  on  how  the  fourth  and  lower  order  moments  of  the  multi¬ 
variate  popula*  ion  from  which  x^,  •••!  x^  was  drawn  behave  as  n  increases,  the  variance  of 
(6)  tends  to  zero  as  n  approaches  infinity.  Since  the  expected  value  of  (6)  is  near  the 
value  of  (7),  this  expected  value  ia  positive  for  almost  af\y  situa*  ion  of  practical  interest. 
Consequently,  it  follows  from  Tchebycheff ‘s  Inequality  that  Pr[(6)  <  o] — >■  0  as  n  — ►  00  , 

Frost  this  it  is  seen  that  the  expected  value  of  the  absolute  value  of  (6)  is  also  approxi¬ 
mately  equal  to  (7)  for  large  n. 

As  the  variance  of  (x  -  p)  has  the  value  (7),  the  properties  stated  for  (2)  follow 
from  Tchebycheff 's  Inequality  combined  with  the  convergence  theoram  [l]. 

Derivations  for  Case  (ii).  In  this  section  it  will  be  shown  that  the  asymptotic 
distribution  of  (3)  is  normal  with  zero  mean  and  variance  approximately  equal  to  unity  if 
(li)  and  certain  minor  restrictions  hold. 

The  expression 


(8) 


i  ^  .  C  i;  .....  Xj^)  .  D  x^ 


can  be  written  as  a  linear  combination  of  the  quantities 


n 


-x2 


^  (Xj  -  i)‘. 


X  [(X^  -  Xj,p' 


Consequently  replacing  each  x  by  x  -  p  leaves  the  value  of  (8)  unchanged. 

Let  us  consider  the  expected  value  of  (8),  Replacing  each  x  in  (8)  by  x  -  p,  this 
expected  value  is  found  to  be  approximately  equal  to  (7)  if 


(9) 


n 


2X  -  P)  ■  P) 


♦  •  •  • 


('‘l.m  -  “ 


1 


n 


*  1^1 


On  the  basis  of  condition  (ii)  and  the  relTlion  m  >  n/lOO,  it  appears  that  (9)  will  be 
satisfied  for  most  situations  of  practical  Importance. 

Imposing  some  weak  restrictions  on  the  behavior  (as  n  increaser)  of  the  fourth  and  lower 
order  moments  of  the  multivariate  population  from  which  x^,  x^  was  obtained,  the 

variance  of  (8)  tends  to  zero  as  n  tends  to  infinity,  Since  the  expected  value  of  (6)  is 
near  (7)»  this  expected  value  >d^  almost  always  be  po  itive  for  situations  approximated  in 
practice.  Hence,  from  Tchebycheff 's  Inei:uality,  Pr[(8)  <  O]  — >0  as  n-^oo.  Thus, 
asymptotically  the  expected  value  of  |(8)1  is  also  approximately  equal  to  (7). 

The  properties  stated  for  (3)  now  follow  from  a  combination  of  Tchebycheff ’s  Inequality 
and  the  convergence  theorem  [ ij  . 
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